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HUMAN PHOSPHATIDE ACID PHOSPHATASE 

Field of the Invention 

This invention relates to human phosphatidic acid 
phosphatase. More particularly, this invention relates 
to three variants of human phosphatidic acid phosphatase 
namely PAP-a(l and 2) , PAP-/3 and PAP-7 and uses thereof. 
The invention encompasses biotechnology inventions, 
including biotechnology products and processes . 

Background of the Invention 

Phosphatidic acid phosphatase (PAP) (also referred 
to in the art as phosphatidate phosphohydrolase) is known 
to be an important enzyme for glycerolipid biosynthesis. 
In particular, PAP catalyzes the conversion of 
phosphatidic acid (PA) (also referred to in the art as 
phosphatidate) into diacylglycerol (DAG) . DAG is an 
important branch point intermediate just downstream of PA 
in the pathways for biosynthesis of glycerophosphate - 
based phospholipids (Kent, Anal. Rev.Biochem. 64: 315- 
343, 1995). 

In eukaryotic cells, PA, the precursor molecule for 
all glycerophospholipids, is converted either to CDP- 
diacylglycerol (CDP-DAG) by CDP-DAG synthase (CDS) or to 
DAG by phosphatidic acid phosphatase (PAP) . In mammalian 
cells, CDP-DAG is the precursor to phosphatidylinositol 
(PI) , phosphatidylglycerol (PG) , and cardiolipin (CL) ; 
whereas diacylglycerol is the precursor to 
triacylglycerol (TG) , phosphatidylethanolamine (PE) , and 
phosphatidylcholine (PC) in all eukaryotic cells. 
Therefore, the partitioning of phosphatidic acid between 
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CDP-diacylglycerol and diacylglycerol is an important 
regulatory point in eukaryotic phospholipid metabolism 
(Shen et al., J. Biol. Chem. 271: 789-795, 1996). 

In addition to being an important enzyme for 
glycerolipid biosynthesis, PAP is also an important 
enzyme for signal transduction. PAP catalyses the 
dephosphorylation of PA to DAG. DAG is a well -studied 
lipid second messenger which is essential for the 
activation of protein kinase C (Kent, Anal. Rev.Biochem. 
64: 315-343, 1995); whereas PA itself is also a lipid 
messenger implicated in various signaling pathways such 
as NADPH oxidase activation and calcium mobilization 
(English, Cell Signal. 8: 341-347, 1996). The regulation 
of PAP activity can therefore affect the balance of 
divergent signaling processes that the cell receives in 
terms of PA and DAG (Brindley et al., Chem. Phys . Lipids 

80 : 45-57, 1996) . 

Various forms of PAP have been isolated in porcine 
(Kai et al., J. Biol. Chem. 271: 18931-18938, 1996) and 
rat species (Brindley et al., Chem. Phys. Lipids 80: 45- 
57, 1996) . Furthermore, the putative amino acid sequence 
of murine PAP has been identified. Kai et al. , supra. 
Prior to the instant invention, however, human PAP had 
not been identified or isolated. 

Genes coding for PAP have been identified in E. coli 
(Dillon et al, J. Biol. Chem. 260: 12078-12083, 1985) and 
in mouse (Kai et al . , J. Biol. Chem. 271: 18931-18938, 
1996). Furthermore, the following GenBank human cDNA 
clones are available: accession nos. H17855, N75714, and 
W70040. No uses were known, however, for these 
polynucleotide sequences . 

Accordingly, there is a need for the identification 
and isolation of human PAP and for methods of using human 
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PAP, for example, for the dephosphorylation of a 
substrate . 

Summary of the Invention 

It is therefore an object of the present invention 
to provide a polynucleotide sequences encoding three or 
more variants of human PAP, namely PAP-or(l and 2), PAP-0 
and PAP -7- 

It is a further object to provide the isolated 
protein of these three variants. 

It is yet a further object to provide a 
biotechnology method for preparing these variants via 

recombinant methods. 

It is a further object to provide a biotechnology 
method of using these variants or human PA in general to 

synthesize DAG. 

In accomplishing these and other objects there is 
provided an isolated polynucleotide encoding human 
phosphatidic acid phosphatase wherein the polynucleotide 
encodes a protein comprising a polypeptide sequence 
selected from the group consisting of (i) the sequence at 
amino acid number 1 to amino acid number 284 (SEQ ID 
NO: 2) in Figure 1, (ii) the sequence at amino acid number 
1 to amino acid number 285 (SEQ ID NO:4) in Figure 2, and 
(iii) the sequence at amino acid number 1 to amino acid 
number 276 (SEQ ID NO: 8) in Figure 4. 

There is further provided an isolated human 
phosphatidic acid phosphatase protein, wherein the 
protein comprises a polypeptide sequence selected from 
0 the group consisting of (i) the sequence at amino acid 

number 1 to amino acid number 284 (SEQ ID NO: 2) in Figure 
1, (ii) the sequence at amino acid number 1 to amino acid 
number 285 (SEQ ID N0:4) in Figure 2, and (iii) the 
sequence at amino acid number 1 to amino acid number 276 
5 (SEQ ID NO: 8) in Figure 4. 
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There if further provided a method of preparing a 
human phosphatidic acid phosphatase -j8 protein comprising 
the steps of (i) transforming a host cell with an 
expression vector comprising a polynucleotide encoding 
human phosphatidic acid phosphatase, (ii) culturing the 
transformed host cells which express the protein and 
(iii) isolating the protein. 

There if further provided a method of 
dephosphorylating a substrate comprising contacting the 
substrate with an effective amount of isolated human 
phosphatidic acid phosphatase protein such that the 
protein catalyzes the dephosphorylation of the substrate. 
It is further provided that the substrate of this method 
is selected from the group consisting of phosphatidic 
acid, lysophosphatidic acid, ceramide 1-phosphate, and 
sphingosine 1-phosphate. It is further provided that 
this method occurs in vitro, and comprises a step of 
isolating the dephosphoryled substrate. Additionally, 
the method can occur in vivo, and is effected by the 
i administration of human phosphatidic acid phosphatase to 

a mammal in need thereof. 

Brief Description of the Drawings 

Figure 1 shows the DNA sequence of the cDNA insert 
5 of the human PAP-al isolated herein and the corresponding 

amino acid sequence (SEQ ID NOS:l and 2) . 

Figure 2 shows the DNA sequence of the cDNA insert 
of the human PAP-a2 isolated herein and the corresponding 
amino acid sequence (SEQ ID NOS:3 and 4) . 
0 Figure 3 shows the DNA sequence of the cDNA insert 

of the human PAP-0 isolated herein and the corresponding 
amino acid sequence (SEQ ID NOS:5 and 6) . 

Figure 4 shows the DNA sequence of the cDNA insert 
of the human PAP-7 isolated herein and the corresponding 
5 amino acid sequence (SEQ ID NOS : 7 and 8) . 
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Figure 5 shows amino acid sequences alignment of the 
murine PAP coding sequence and the coding sequences for 
human PAP-«(1 and 2) , PAP-0 and PAP-7 (SEQ ID NOS:9-13) . 
Figure 6 shows the effect of IL-10 on PAP-0 
5 expression in human endothelial ECV304 cells using 

Northern blot analysis. 

Figure 7 depicts a thin layer chromatography 
analysis demonstrating the increase in PA 
dephosphorylation in cells transfected with either the 
10 PAP-al or PAP-a2 cDNA expression plasmids. 

Figure 8 shows the differential expression of PAP- 
a mRNA in various tumor versus normal tissues. 

Figure 9 is a schematic representation of 
glycerophospholipid biosynthesis involving the conversion 
15 of PA to either DAG or CDP-DAG . The synthesis of PA to 

DAG involves the PAP enzyme, while the synthesis of PA to 
CPD-DAG involves the CDS enzyme. 

n«mUed Desc ription of Preferred Embodiments 
20 This invention relates to isolated human 

phosphatidic acid phosphatase. More particularly, this 
invention relates to three variants of human phosphatidic 
acid phosphatase namely PAP-a(l and 2), PAP-0 and PAP-7 • 
Examples of the uses for human PAP include the 
25 following. PAP is an important tool for enzymatic 

catalysis of several biologically significant proteins. 
As discussed above, PAP catalyzes the dephosphorylation 
of PA to DAG. DAG, in turn, is essential for the 
activation of protein kinase C (Kent, Anal. Rev.Biochem. 

30 64: 315-343, 1995) . 

Moreover, PAP catalyzes the dephosphorylation of 
lysophosphatidic acid (LPA) , ceramide l-phosphate (C-l- 
P) , and sphingosine l-phosphate (S-l-P) (Brindley et al., 
Chem. Phys. Lipids 80: 45-57, 1996) . In the case of LPA, 

35 S-l-P, and C-l-P, the products of the PAP reaction are 

monoacylglycerol, sphingosine, and ceramide, 
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respectively. PAP can control the balance of a wide 
spectrum of lipid mediators of cell activation and signal 
transduction by modulating the phosphorylated state of 

these lipids. 

Additionally, the human PAP of the present invention 
are likely to define a new family of tumor suppressor 
genes that can be used as candidate genes for gene 
therapy for the treatment of certain tumors. The 
relationship of PAP and tumor suppression is evidenced 
0 in findings that PAP activity is lower in fibroblast cell 

lines transformed with either the ras or fps oncogene 
than in the parental rati cell line (Brindley et al . , 
Chem. Phys. Lipids 80: 45-57, 1996). Decrease in PAP 
activity in transformed cells correlates with a 
concomitant increase in PA concentration. Moreover, 
elevated PAP activity and lower level of PA has been 
observed in contact -inhibited fibroblasts relative to 
proliferating and transformed fibroblasts (Brindley et 
al., Chem. Phys. Lipids 80: 45-57, 1996). Therefore, PAP 
plays a role in decreasing cell division and as such can 
provide a useful tool in treating cancer. 

Additionally, PA, the substrate for the enzyme PAP, 
has been implicated in cytokine induced inflammatory 
responses (Bursten et al., Circ. Shock 44 s 14-29, 1994; 
Abraham et al . , J. Exp. Med. 181: 569-575, 1995; Rice et 
al., Proc. Natl. Acad. Sci. USA 91: 3857-3861 1994; Leung 
et al., Proc. Natl. Acad. Sci. USA 92: 4813-4817, 1995) 
and the modulation of numerous protein kinases involved 
in signal transduction (English et al., Chem. Phys. 
Lipids 80: 117-132, 1996). Because of the possibility 
that activation of human PAP expression can counter- 
balance the inflammatory response from cytokine 
stimulation through degradation of excess amount of PA in 
cells, the genes encoding human PAP can be used in gene 
35 therapy for the treatment of inflammatory diseases. 
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Human PAP described herein can also be used in gene 
therapy for the treatment of obesity associated with 
diabetes. PAP activity is decreased in the livers and 
hearts of the grossly obese and insulin resistant JCR:LA 
corpulent rat compared to the control lean phenotype 

(Brindley et al., Chem. Phys. Lipids 80: 45-57, 1996). 
Human PAP described herein therefore can provide an 

important tool for the treatment of obesity associated 

with diabetes. 

1. Human PAP 

As used herein, "phosphatidic acid phosphatase" or 
"PAP" refers to a protein capable of catalyzing the 
dephosphorylation of PA to DAG. PAP also includes 
proteins capable of catalyzing the dephosphorylation of 
lysophosphatidic acid (LPA) , ceramide l-phosphate (C-l- 
p) , and sphingosine l-phosphate (S-l-P) • 

As used herein, "isolated" PAP denotes a degree of 
separation of the protein from other materials endogenous 
to the host organism. As used herein, "purified" denotes 
a higher degree of separation than isolated. A purified 
protein is sufficiently free of other materials 
endogenous to the host organism such that any remaining 
materials do not adversely affect the biological 
properties of the protein, for example, a purified 
protein is one sufficiently pure to be used in a 
pharmaceutical context . 

As used herein, "human" PAP refers to PAP naturally 
occurring (or "native") in the human species, including 
natural variations due to allelic differences. The term 
"human PAP," however, is not limited to native human 
proteins, but also includes amino acid sequence variants 
of native human PAP that demonstrate PAP activity, as 

defined above. 

Variants often exhibit the same qualitative 
biological activity as the naturally-occurring analogue, 
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although variants also are selected in order to modify 
the characteristics of PAP protein. In a preferred 
embodiment, therefore, human PAP includes the amino acid 
sequences of Figures 1-4 (SEQ ID NOS:2, 4, 6 and 8), 

5 being PAP-al, PAP-a2, PAP-/3 and PAP-y, respectively and 

variants thereof. 

Amino acid sequence variants of the protein can be 
substitutional, insertional or deletion variants. 
Deletion variants lack one or more residues of the native 

LO protein which are not essential for biological activity. 

An example of a common deletion variant is a protein 
lacking transmembrane sequences. Another example is a 
protein lacking secretory signal sequences or signal 
sequences directing the protein to bind to a particular 

15 part of a cell. 

Substitutional variants typically contain the 
exchange of one amino acid for another at one or more 
sites within the protein, and are designed to modulate 
one or more properties of the protein such as stability 
20 against proteolytic cleavage. Substitutions preferably 

are conservative, that is, one amino acid is replaced 
with one of similar shape and charge. Conservative 
substitutions are well known in the art and include, for 
example, the changes of: alanine to serine; arginine to 
25 lysine; asparigine to glutamine or histidine; aspartate 

to glutamate; cysteine to serine; glutamine to 
asparigine; glutamate to aspartate; glycine to proline; 
histidine to asparigine or glutamine; isoleucine to 
leucine or valine; leucine to valine or isoleucine; 
3 0 lysine to arginine, glutamine, or glutamate; methionine 

to leucine or isoleucine; phenylalanine to tyrosine, 
leucine or methionine; serine to threonine; threonine to 
serine; tryptophan to tyrosine; tyrosine to tryptophan 
or phenylalanine; and valine to isoleucine or leucine. 
35 of course, other amino acid substitutions can be 

undertaken. 
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Insertional variants contain fusion proteins such 
as those used to allow rapid purification of the protein 
and also can include hybrid proteins containing sequences 
from other proteins and polypeptides which are protein 

5 homologues . 

Variants of human PAP also include fragments, 
analogs, derivatives, muteins and mimetics of the natural 
PAP protein that retain the ability to cause the 
beneficial results described above. Fragments of the 

L0 human PAP protein refer to portions of the amino acid. 

sequence of the PAP polypeptide that also retain this 
ability. 

Variants can be generated directly from the human 
PAP protein itself by chemical modification by 
15 proteolytic enzyme digestion, or by combinations thereof. 

Additionally, methods of synthesizing polypeptides 
directly from amino acid residues also exist. 

Non-peptide compounds that mimic the binding and 
function of the human PAP protein ("mimetics") can be 
20 produced by the approach outlined in Saragovi et al . , 

Science 253: 792-95 (1991). Mimetics are peptide- 
containing molecules which mimic elements of protein 
secondary structure. See, for example, Johnson et 
al . , " Peptide Turn Mimetics " in BIOTECHNOLOGY AND 
25 PHARMACY, Pezzuto et al., Eds., (Chapman and Hall, New 

York, 1993) . 

The underlying rationale behind the use of peptide 
mimetics is that the peptide backbone of proteins exists 
chiefly to orient amino acid side chains in such a way 
3 0 as to facilitate molecular interactions For the purposes 

of the present invention, appropriate mimetics can be 
considered to be the equivalent of the human PAP protein 
itself . 

More typically, at least in the case of gene 
3 5 therapy, variants are created by recombinant techniques 

employing genomic or cDNA cloning methods. Site-specific 
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and region-directed mutagenesis techniques can be 
employed. See CURRENT PROTOCOLS IN MOLECULAR BIOLOGY 
vol. 1, ch. 8 (Ausubel et al. eds.. J ■ Wiley & Sons 1989 
& Supp. 1990-93) ; PROTEIN ENGINEERING (Oxender & Fox 
eds., A- Liss, inc. 1987). In addition, linker- scanning 
and ' PCR-tnediated techniques can be employed for 
mutagenesis. See PCR TECHNOLOGY (Erlich ed. , Stockton 
Press 1989); CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, 
vols. 1 & 2, supra. Protein sequencing, structure and 
modeling approaches for use with any of the above 
techniques are disclosed in PROTEIN ENGINEERING, loc. 
cit. and CURRENT PROTOCOLS IN MOLECULAR BIOLOGY, vols. 1 
& 2, supra. 

o P^lYTmcleofcidaa Enc oding Human PAP 

The present invention further includes isolated 
polynucleotides encoding human phosphatidic acid 
phosphatase. As used herein, an "isolated" 

polynucleotide denotes a degree of separation of the 
polynucleotide from its naturally occurring environment, 
e g., from its native intact genome. In a preferred 
embodiment, the isolated polynucleotides correspond to 
those shown in Figure 1 at nucleotide number 342 to 
nucleotide number 1193 of SEQ ID NO:l; Figure 2 at 
nucleotide number 342 to nucleotide number 1196 of SEQ ID 
NO:3; Figure 3 at nucleotide number 294 to nucleotide 
number 1226 of SEQ ID NO:5; and Figure 4 at nucleotide 
number 4 to nucleotide number 833 of SEQ ID NO: 7. 

The invention furthermore relates to a 
jO polynucleotide whose sequence is degenerate with respect 

to the sequences mentioned above in accordance with the 
nature of the genetic code. Degeneracy is often referred 
to as codon/anticodon wobble, and is discussed in Watson 
et al., MOLECULAR BIOLOGY OF THE GENE (4th ed. 1987) at 
35 437-43. 
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The present invention further includes bases, 
nucleosides, nucleotides, oligonucleotides derived from 
the isolated polynucleotides of the present invention. 
The term -derived" when used in the context of the 
present invention connotes a degree of similarity that 
is sufficient to indicate the original polynucleotide 
from which hybrid forms, or portions thereof, were 
obtained. Also within the scope of the invention are so- 
called "polyamide" or "peptide" nucleic acids ("PNAs") 
derived from the polynucleotides of the present 
invention. PNAs are constructed by replacing the 

(deoxy) ribose phosphate backbone of a sub 3 ect 
polynucleotide with an achiral polyamide backbone or the 
like. See Nielsen et al . , Science 254: 1497-54 (1991). 
The above polynucleotides and derivations thereof 
can be used as important tools in recombinant DNA and 
other protocols involving nucleic acid hybridization 
techniques. More specifically, oligonucleotides and 
nucleic acids derived from the isolated polynucleotides 
shown in Figures 1-4 (SEQ ID NOS : 1 , 3, 5, and 7) can be 
used as hybridization probes, capable of recognizing and 
specifically binding to complementary nucleic acid 
sequences, providing thereby a means of detecting, 
identifying, locating and measuring complementary nucleic 
acid sequences in a biological sample. 

Biological samples include, among a great many 
others, blood or blood serum, lymph, ascites fluid, 
urine, microorganism or tissue culture medium, cell 
extracts, or the like, derived from a biological source, 
or a solution containing chemically synthesized protein, 
or an extract or solution prepared from such fluid from 

a biological source. 

An oligonucleotide containing a modified nucleotide 
of the invention can be used as a primer to initiate 
nucleic acid synthesis at locations in a DNA or RNA 
molecule comprising the sequence complementary to the 
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oligonucleotide sequence. The synthesized nucleic acid 
strand would have incorporated, at its 5' terminus, the 
oligonucleotide primer bearing the invention and would, 
therefore, be detectable by exploitation of the 
characteristics of the detectable label. Two such 
primers, specific for different nucleotide sequences on 
complementary strands of dsDNA , can be used in the 
polymerase chain reaction (PCR) to synthesize and amplify 
the amount of a nucleotide sequence. The detectable 
label present on the primers will facilitate the 
identification of desired PCR products. PCR, combined 
with techniques for preparing complementary DNA (cDNA) 
can be used to amplify various RNAs, with oligonucleotide 
primers again serving both to provide points for 
initiation of synthesis in the cDNA duplex flanking the 
desired sequence and to identify the desired product. 
Primers labeled with the invention may also be utilized 
for enzymatic nucleic acid sequencing by the dideoxy 
chain- termination technique. 

The invention can be applied to measure or 
quantitate the amount of DNA present in a sample. For 
instance, the concentration of nucleic acid can be 
measured by comparing detectable labels incorporated into 
the unknown nucleic acid with the concentration of 
detectable labels incorporated into known amounts of 

nucleic acid. 

Such a comparative assessment can be done using 
biotin where the respective concentrations are determined 
by an enzyme-linked assay utilizing the streptavidin- 
alkaline phosphatase conjugate and a substrate yielding 
a soluble chromogenic or chemi luminescent signal. 

3. Recombinant Production o * H '"" an PAP 

In a further embodiment human PAP is expressed via 

recombinant methods known to those of skill in the art. 

The polynucleotides of the present invention can be 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 



13 



PCT/US98/07928 



15 



expressed in any number of different recombinant DNA 
expression systems to generate large amounts of protein, 
which can then be purified and used for the various 
applications of human PAP described above. Included 
5 within the present invention are proteins having native 

glycosylation sequences, and deglycosylated or 
unglycosylated proteins prepared by the methods described 

below . 

Recombinant technology for producing desired 
10 proteins is known by ordinarily skilled artisans and 

includes providing a coding sequence for a desired 
protein, and operably linking the coding sequence to 
polynucleotide sequences capable of effecting its 
expression. 

With regard to one aspect of the invention, it often 
is desirable to produce human PAP as a fusion protein, 
freed from upstream, downstream or intermediate 
sequences, or as a protein linked to leader sequences, 
effecting secretion of human PAP into cell culture 

20 medium. 

A typical expression system will also contain 

control sequences necessary for transcription and 
translation of a message. Known control sequences 
include constitutive or inducible promoter systems, 
translational initiation signals (in eucaryotic 
expression), poly adenylat ion translation termination 
sites, and transcription terminating sequences. 
Expression vectors containing controls which permit 
operably linking of desired coding sequences to required 
30 control systems are known by the skilled artisan. Such 

vectors can be found which are operable in a variety of 
hosts . 

Human PAP of the present invention may be produced 
in procaryotic cells using appropriate controls, such as 
35 trp or lac promoters, or in eucaryotic host cells, 

capable of effecting post -translational processing that 
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permits proteins to assume desired three-dimensional 
conformation. Eucaryotic control systems and expression 
vectors are known; including leu and glycolytic promoters 
useful in yeast, the viral SV40 and adenovirus and CMV 
promoters in mammalian cells, and the baculovirus system 
which is operable in insect cells. Plant vectors with 
suitable promoters, such as the nos promoter are also 
available . 

Standard laboratory manuals (e.g., Sambrook et al . , 
MOLECULAR CLONING: A LABORATORY MANUAL, Second Edition, 
(Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY 1989) present standard techniques and methodologies 
for expressing polynucleotides encoding a desired 
protein, culturing appropriate cells, providing suitable 
expression conditions, and recovering a resulting protein 

from culture. 

In preparing the inventive human PAP, a suitable 
polynucleotide encoding human PAP, constructed utilizing 
any of the foregoing techniques is operable linked to an 
expression vector which is then transformed into a 
compatible host. Host cells are cultured using 
conditions appropriate for growth. Expression of the 
desired human PAP is preferably induced after some 
predetermined growth level has occurred. Human PAP 
production is monitored and the desired protein isolated 
from culture either from a supernatant, or by first 
lysing host cells with an appropriate agent, or by other 
methods known to the skilled artisan. 

In another preferred embodiment, a polynucleotide 
encoding human PAP is ligated into a mammalian expression 
vector. A preferred mammalian expression vector is the 
plasmid "pCE2 . n The plasmid pCE2 is derived from pREP7b 
(Leung, et al . , Proc. Natl. Acad. Sci. USA, 92: 4813- 
4817, 1995) with the RSV promoter region replaced by the 
CMV enhancer and the elongation factor- la (EF-la) 
promoter and intron. The CMV enhancer of the pCE2 vector 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 



15 



PCT/US98/07928 



is constructed from a 380 bp Xba I-Sph I fragment 
produced by PCR from pCEP4 (Invitrogen, San Diego, CA) 
using the primers 5 1 -GGCTCTAGAT ATTAATAGTA ATCAATTAC - 3 1 
(SEQ ID NO: 14) and 5 1 -CCTCACGCAT GCACCATGGT AATAGC - 3 1 
(SEQ ID NO: 15) . The EF-la promoter and intron (Uetsuki, 
et al., J. Biol. Chem., 264: 5791-5798, 1989) are 
constructed from a 1200 bp Sph I-Asp718 I fragment 
produced by PCR from human genomic DNA using the primers 
5 1 -GGTGCATGCG TGAGGCTCCG GTGC-3 1 (SEQ ID NO: 16) and 5 1 - 
GTAGTTTTCA CGGTACCTGA AATGGAAG- 3 1 (SEQ ID NO: 17) . These 
2 fragments are ligated into a Xba I/Asp718 I digested 
vector derived from pREP7b to generate pCE2 . 

In another preferred embodiment of the present 
invention, pCE2 containing a polynucleotide expressing 
human PAP is used to transform a host cell which then 
expresses the protein. Preferred host cells include the 
human embryonic kidney cell line 293-EBNA (Invitrogen, 
San Diego, CA) , endothelial ECV304 cells, and epithelial 
A549 cells. 

4. Dephosphorvlation of Substrate 

In another embodiment, the present invention 
includes a method of dephosphorylating a substrate by 
contacting the substrate with an effective amount of 
isolated human PAP. An "effective amount" of human PAP 
is an amount which will dephosphorylate a detectable 
amount of substrate. Such an amount can be determined 
empirically based on variables well known to those of 
skill in the art, such as reaction time and temperature. 

In one embodiment, the substrate includes 
phosphatidic acid, lysophosphatidic acid, ceramide 1- 
phosphate, and sphingosine 1-phosphate. In another 
embodiment, the isolated human PAP includes PAP-a(l and 
2), PAP-/3 and PAP-7 and variants thereof. 

In a further embodiment, the dephosphorylation of 
substrate occurs in vitro, by contacting a substrate with 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 



16 



PCT/US98/07928 



recombinant ly produced human PAP expressed by the methods 
described above. The dephosphorylated substrate is then 
isolated by standard isolation and purification methods, 
including for example, thin layer chromatography or high 
5 pressure liquid chromatography. 

In another embodiment, the dephosphorylation of 
substrate occurs in vivo via the administration of human 
PAP to a mammal, preferably a human. "Administration" 
means delivery of human PAP protein to a mammal by 
0 methods known to those of skill in the art including, but 

not limited to: orally, for example in the form of 
pills, tablets, lacquer tablets, coated tablets, 
granules, hard gelatin capsules, soft gelatin capsules, 
solutions, syrups, emulsions, suspensions or aerosol 
5 mixtures; rectally, for example in the form of 

suppositories; parenterally , for example in the form of 
injection solutions or infusion solutions, microcapsules 
or rods; percutaneously, for example in the form of 
ointments or tinctures; transdermally ; intravascularly , 
20 intracavitary; intramuscularly; subcutaneous ly ; and 

nasally, for example in the form of nasal sprays or 
inhalants . 

The administration of human PAP protein includes the 
administration of the protein combined in a mixture with 
25 a pharmaceutical^ acceptable carrier vehicle. Suitable 

vehicles and their formulation, inclusive of other human 
proteins, e.g. human serum albumin, are described for 
example in Remington's Pharmaceutical Sciences by E.W. 
Martin, which is hereby incorporated by reference. Such 
30 compositions will contain an effective amount of protein 

hereof together with a suitable amount of vehicle in 
order to prepare pharmaceutically acceptable compositions 
suitable for effective administration to the host. 

Such compositions should be stable for appropriate 
35 periods of time, preferably are acceptable for 

administration to humans and preferably are readily 
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manuf acturable . Although pharmaceutical solution 
formulations are provided in liquid form appropriate for 
immediate use, formulations may also be provided in 
frozen or in lyophilized form. In the former case, the 
composition must be thawed prior to use. The latter form 
is often used to enhance the stability of the medicinal 
agent contained in the composition under a wide variety 
of storage conditions. Such lyophilized preparations are 
reconstituted prior to use by the addition of suitable 
pharmaceutical ly acceptable diluents, such as sterile 
water or sterile physiological saline solution. 

Additionally, administration is meant to include 
delivery of human PAP protein to a mammal by means of 
gene therapy techniques, i.e., by the delivery of 
polynucleotides encoding human PAP to PAP-def icient 
cells, whereby human PAP is then expressed in the cell. 
Gene therapy techniques are known to those of skill in 
the art. For example, listing of present-day vectors 
suitable for use in gene therapy of the present invention 
is set forth in Hodgson, Bio/Technology 13: 222 (1995) . 
See also , Culver et al., Science, 256:1550-62 (1992). 

Additionally, liposome -mediated gene transfer is 
another suitable method for the introduction of a 
recombinant vector containing a polynucleotide encoding 
human PAP into a PAP-def icient cell . See Caplen et al., 
Nature Med. 1:39-46 (1995) and Zhu et al., Science 
261:209-211 (1993). 

Additionally, viral vector-mediated gene transfer is 
also a suitable method for the introduction of a 
recombinant vector containing the gene encoding human PAP 
into a PAP-def icient cell. Examples of appropriate viral 
vectors are adenovirus vectors. Detailed discussions of 
the use of adenoviral vectors for gene therapy can be 
found in Berkner, Biotechnigues 6:616-629 (1988), 
35 Trapnell, Advanced Drug Delivery Rev. 12:185-199 (1993). 
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The following examples merely illustrate the 
invention and, as such, are not to be considered as 
limiting the invention set forth in the claims. 

Example 1 

Cloning and Expression of Human PAP-a, PAP-fl an d PAP-7 

Homology search of the Genbank database (Boguski, 
et al., Science 265:1993-1994, 1994) of expressed 
sequence tag (dbEST) using the murine PAP protein 
sequence (Kai et al . , J. Biol. Chem. 271: 18931-18938, 
1996) as probe identified several short stretches of 
human cDNA sequences with homology to the murine PAP 
protein sequence. These cDNA sequences of interest were 
derived from single-run partial sequencing of random 
human cDNA cloning projects carried out mainly by 
I.M.A.G.E. Consortium [LLNL] cDNA clones program. Based 
on the partial DNA sequences available in the GenBank 
database, the human cDNA clones that are homologous to 
the murine PAP protein sequence can be grouped into three 
classes, suggesting the presence of at least three 
different human PAP variants, designated as PAP-a, PAP- 
@, and PAP-7 here. For instance, a potential human PAP-a 
clone (GenBank #H17855) identified contains sequence 
homologous to aa 272-283 and the 3 ' -untranslated region 
of murine PAP ; a potential human PAP-/3 clone (GenBank 
#W70040) identified contains sequence similarities 
corresponding to aa 175-251 of murine PAP; and a 
potential human PAP-7 clone (GenBank #N75714) identified 
contains sequences similarities corresponding to aa 18- 
142 of murine PAP. These cDNA clones were purchased 
(Genome Systems, St. Louis, MO) for further analysis. 
DNA sequence determination of the entire cDNA inserts of 
these clones showed clone H17855 contained sequences that 
are homologous to the N- and C- terminal sequences of 
murine PAP with a gap of about 150 bp that led to a frame 
shift in reading frame. This clone is most likely a 
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spuriously spliced form of PAP-a clone. Clone W70040 was 
found to be a full-length PAP-/3 clone, and clone N75714 
was found to be a partial PAP-7 clone with an open 
reading frame homologous to the region from aal8 to the 
C- terminus of murine PAP. 

To assemble a full-length functional PAP-a clone, 
synthetic oligonucleotides o_papalF, 5 ' -ggcatggtAC 
CATGTTTGAC AAGACGCGGC - 3 ' (SEQ ID NO: 18) , based on the N- 
terminal region of PAP-a and o_papalR, 5 ' - CATATGTAGT 
ATTCAATGTA ACC-3 ' (SEQ ID NO: 19), based on a region 
downstream of a Pst I site complementary to the coding 
strand of PAP-a were used to amplify the N- terminal 
coding region of PAP-a from a human lung cDNA library 
(Life Technologies, Inc., Gaithersburg, MD) - The 450 bp 
Acc65 I - Pst I fragment generated was inserted into a 
Acc65 I / Pst I vector from pBluescript (II) SK(-) 
(Stratagene, San Diego, CA) for further analysis. DNA 
sequence analysis of the subclones obtained revealed at 
least two different classes of clones with sequences that 
0 diverged at the putative exon of interest, suggesting the 

presence of two alternatively spliced forms of PAP-a. 
These two alternatively spliced forms of PAP-a are 
designated as PAP-al and PAP-a2 here. Each of the 
individual 450 bp Acc65 I - Pst I fragment generated by 
5 PCR was combined with the 810 bp Pst I - Not I fragment 

derived from clone H17855 for ligation into a Acc65 I / 
Not I mammalian expression vector derived from pCE2 for 
the generation of expression plasmids for PAP-al and PAP- 
a2. The plasmid pCE2 was derived from pREP7b (Leung, et 
0 al., Proc. Natl. Acad. Sci. USA, 92: 4813-4817, 1995) 

with the RSV promoter region replaced by the CMV enhancer 
and the elongation factor-la (EF-la) promoter and intron. 
The CMV enhancer of the pCE2 vector was constructed from 
a 380 bp Xba I-Sph I fragment produced by PCR from pCEP4 
5 (Invitrogen, San Diego, CA) using the primers 5'- 

GGCTCTAGAT ATTAATAGTA ATCAATTAC - 3 ' (SEQ ID NO: 14) and 5 ' - 
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CCTCACGCAT GCACCATGGT AATAGC-3 ' (SEQ ID NO: 15) . The EF- 
la promoter and intron (Uetsuki, et al . , J. Biol. Chem. , 
264: 5791-5798, 1989) was constructed from a 1200 bp Sph 
I-Asp718 I fragment produced by PCR from human genomic 
DNA using the primers 5 1 -GGTGCATGCG TGAGGCTCCG GTGC-3 ' 
(SEQ ID NO: 16) and 5 • -GTAGTTTTCA CGGTACCTGA AATGGAAG-3 ' 
(SEQ ID NO: 17). These 2 fragments were ligated into a 
Xba I/Asp718 I digested vector derived from pREP7b to 

generate pCE2 . 

The DNA sequence determined from clone N75714 was 
used as a probe to search for clones with overlapping 
sequences in the GenBank database. Clone Z43618 was 
found to contain an additional 5' -sequence with a 
potential ATG initiation codon. To assemble a full- 
15 length PAP-7 clone, synthetic oligonucleotides ojpapglF, 

5 * - tgatggctag cATGCAGAGA AGATGGGTCT TCGTGCTGCT CGACGTG-3 ' 
(SEQ ID NO:20), based on the N-terminal region of PAP-7 
and o_papglR, 5 ' -AGTGCGGGAT CCCATAAGTG GTTG-3 ' , (SEQ ID 
NO: 21) based on a region complementary to the coding 
2 0 strand of PAP-7 just downstream of its stop codon were 

used to generate the full-length coding region of PAP-7 
by PCR using the clone N75714 as template. The 820 bp 
Nhe I - BamH I fragment obtained was then ligated into a 
Nhe I / BamH I mammalian expression vector derived from 
2 5 pCE2 . 

Figures 1, 2, 3 and 4 show the translated DNA 
sequences of the putative human cDNA clones for PAP-al, 
a2, 0 and 7, (SEQ ID NOS : 1 , 3, 5 and 7) respectively. 
The designated ATG initiation site for translation of 
30 each cDNA clone fulfills the requirement for an adequate 

initiation site according to Kozak (Kozak, Critical Rev. 
Biochem. Mol . Biol. 27:385-402, 1992). 

The amino acid sequence of each open reading frame 
(Figures 1, 2, 3 and 4 (SEQ ID NOS:2, 4, 6 and 8)) was 
35 used as the query sequence to search for homologous 

sequences in protein databases. Search of the Genbank 
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database from the National Center for Biotechnology 
information (NCBI) using the blastp program showed that 
these proteins are most homologous to the murine PAP 
sequence (Kai et al . , J. Biol. Chem. 271: 18931-18938, 
1996) , and a rat endoplasmic reticulum resident 
transmembrane protein of unknown function, Dri 42, whose 
expression is up-regulated during epithelial 
differentiation (Barila et al., J. Biol. Chem. 271: 
29928-29936, 1996) . 

Example 2 

activation of PAP- ft Transcription bv ILl-ff 

It is possible that activation of PAP-/3 expression 
can counter-balance the inflammatory response from IL-10 
stimulation through degradation of the excess amount of 
PA in cells. To determine whether IL1-0, an inflammatory 
cytokine, would activate the transcription of PAP mRNAs, 
Northern analysis of PAP-/3 mRNA levels (Fig. 6) was 
performed in human endothelial ECV304 cells at various 
times after IL-10 stimulation. Figure 6 shows that PAP-/3 
mRNA expression was induced after incubation of ECV304 
cells with IL-10 after at least 6 hours, suggesting that 
PAP-0 is a late-response gene to IL-1/3 stimulation. This 
indicates that human PAP may act to reduce IL-10 induced 
inflammation by degrading excess PA in cells. 

Example 3 

P&P-«1 and PAP-a2 ^phosphory lation, of PA to DAG 

The expression of PAP-al and PAP-a2 cDNA was found 
to increase PA dephosphorylation in mammalian cells. 
The expression plasmids for PAP-al, PAP-a2 and the 
control vector were transiently transfected into 293-EBNA 
(EB293) cells (Invitrogen, San Diego, CA) using the 
lipofectant DOTAP (Boehringer Mannheim, Indianapolis, 
IN) . PAP activities were followed by TLC analysis based 
on the conversion of {C w l PA (DuPont NEN, Boston, MA) to 
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[C ,4 ]DAG using membrane fractions isolated from the 
various cell extracts. Figure 7 shows membrane fractions 
derived from cells transfected with either the PAP-al 
(lanes 6 and 7) or PAP-a2 (lanes 8 and 9) produced more 
[C U ]DAG those f rom untransf ected cells (lanes 2 and 3) or 
from cells transfected with the control P CE2 vector 
(lanes 4 and 5) . In this particular chromatography 
system, DAG can be resolved into two bands, possibly due 
to heterogeneity in the acyl -chains. It appears that 
PAP-al and PAP-a2 preferentially dephosphorylate 
different species of PA as evidenced by the change in 
relative intensity of the two DAG bands (lanes 6 to 9) . 




The possibility that PAP-a expression can degrade the 
excess amount of PA in cells suggests that PAP-a may be 
down-regulated in tumor cells when compared to normal 

;0 cells, as tumor cells tend to be more inflammatory due to 

a possibly higher level of PA when compared to normal or 
resting cells. To test this hypothesis, Northern 
analysis using PAP-a (1 and 2) cDNA probe was performed on 
RNA blots derived from various matching pairs of tumor 

25 and normal tissues (Invitrogen, Carlsbad, CA) . Figure 8 

shows the expression levels of PAP-a mRNA are 
substantially higher in five out of eight of the normal 
tissues examined; namely, colon, rectal, breast, 
fallopian tube, and ovarian tissues when compared to the 

30 corresponding tumor tissues. 



SUBSTITUTE SHEET (RULE 26) 



WO 98/46730 



PCT/US98/07928 



23 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: LEUNG , David W. 

TOMPKINS , Christopher K 



(ii) TITLE OF INVENTION: HUMAN PHOSPHATIDIC ACID PHOSPHATASE 



(iii) NUMBER OF SEQUENCES: 21 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Foley & Lardner 

(B) STREET: 3000 K Street, N.W., Suite 500 

(C) CITY: Washington 

(D) STATE: D.C. 

(E) COUNTRY: USA 

(F) ZIP: 20007-5109 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/842,827 

(B) FILING DATE: 17-APR-1997 

(C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: BENT , Stephen A. 

(B) REGISTRATION NUMBER: 29,768 

(C) REFERENCE/DOCKET NUMBER: 77319/125 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 672-5300 

(B) TELEFAX: (202)672-5399 

(C) TELEX: 904136 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1563 base pairs 

(B) TYPE: nucleic acid 

(C) S TRANDEDNES S : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 342.. 1193 

(ix) FEATURE: 

(A) NAME /KEY : mat_peptide 

(B) LOCATION: 342. .1193 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
CCTGTGGGAG AGAGCGCCGG GATCCGGACG GGGTAGCAAC CGGGGCAGGC CGTGCCGGCT 60 
GAGGAGGTCC TGAGGCTACA GAGCTGCCGC GGCTGGCACA CGAGCGCCTC GGCACTAACC 120 
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GAGTGTTCGC GGGGGCTGTG AGGGGAGGGC CCCGGGCGCC ATTGCTGGCG GTGGGAGCGC 

CGCCCGGTCT CAGCCCGCCC TCGGCTGCTC TCCTCCTCCG GCTGGGAGGG GCCGTATCTC 

GGGGCCGTCG CCAGCCCCGG CCCGGGCTCG ATAATCAAGG GCCTCGGCCG TCGTCCCGCA 

rrrrnTTCCA TCGCCCTTGC CGGGCAGCCC GGGCAGAGAC C ATG TTT GAC AAG 
LUL Met Phe Asp Lys 

l 

ACG CGG CTG CCG TAC GTG GCC CTC GAT GTG CTC TGC GTG TTG CTG GCT 
Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys Val Leu Leu Ala 
5 10 15 

GGA TTG CCT TTT GCA ATT CTT ACT TCA AGG CAT ACC CCC TTC CAA CGA 
Glv Leu Pro Phe Ala He Leu Thr Ser Arg His Thr Pro Phe Gin Arg 

25 30 35 

CGA GTA TTC TGT AAT GAT GAG TCC ATC AAG TAC CCT TAC AAA GAA GAC 
Gly Val Phe Cys Asn Asp Glu Ser lie Lys Tyr Pro Tyr Lys Glu Asp 

ACC ATA CCT TAT GCG TTA TTA GGT GGA ATA ATC ATT CCA TTC AGT ATT 
Thr lie Pro Tyr Ala Leu Leu Gly Gly lie lie lie Pro Phe Ser lie 
55 60 65 

ATC GTT ATT ATT CTT GGA GAA ACC CTG TCT GTT TAC TGT AAC CTT TTG 
lie Val lie lie Leu Gly Glu Thr Leu Ser Val Tyr Cys Asn Leu Leu 
70 75 80 



nr TCA AAT TCC TTT ATC AGG AAT AAC TAC ATA GCC ACT ATT TAC AAA 
Ss ser Asn Ser Phe He Arg Asn Asn Tyr lie Ala Thr lie Tyr Lys 
85 90 95 l° u 

GCC ATT GGA ACC TTT TTA TTT GGT GCA GCT GCT AGT CAG TCC CTG ACT 
Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser Gin Ser Leu Thr 

105 HO i15 

GAC ATT GCC AAG TAT TCA ATA GGC AGA CTG CGG CCT CAC TTC TTG GAT 
Aso He Ala Lys Tyr Ser He Gly Arg Leu Arg Pro His Phe Leu Asp 
V 120 125 130 

GTT TGT GAT CCA GAT TGG TCA AAA ATC AAC TGC AGC GAT GGT TAC ATT 
val Cys Asp Pro Asp Trp Ser Lys He Asn Cys Ser Asp Gly Tyr He 
135 140 145 

rlA TAC TAC ATA TGT CGA GGG AAT GCA GAA AGA GTT AAG GAA GGC AGG 
SJ Sr ?yr lie Cys Arg Gly Asn Ala Glu Arg Val Lys Glu Gly Arg 
150 155 I 60 

TTG TCC TTC TAT TCA GGC CAC TCT TCG TTT TCC ATG TAC TGC ATG CTG 
£u ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met Tyr Cys Met Leu 

165 l" 70 175 

TTT GTG GCA CTT TAT CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA AGA 
Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala Arg 



185 



CTC TTA CGC CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT TAT 
LeS Leu Arg Pro Thr Leu Gin Phe Gly Leu Val Ala Val Ser He Tyr 

200 205 21 

CTG GGC CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT GTG 
vTl fly HI Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp Val 
215 220 



180 
240 
300 
353 



401 



449 



497 



545 



593 



641 



689 



737 



785 



833 



881 



929 



977 



1025 
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TTG ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT GTA 1073 

Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala He Leu Val Ala Val 
230 235 240 

TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA AAA 1121 
Tvr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg Lys 
245 250 255 260 

GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG AAT 116 9 

Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly Asn 

265 270 275 

CAC TAT CCG AGC AAT CAC CAG CCT TGAAAGGCAG CAGGGTGCCC AGGTGAAGCT 1223 
His Tyr Pro Ser Asn His Gin Pro 

280 

GGCCTGTTTT CTAAAGGAAA ATGATTGCCA CAAGGCAAGA GGATGCATCT TTCTTCCTGG 1283 

TGTACAAGCC TTTAAAGACT TCTGCTGCTG ATATGCCTCT TGGATGCACA CTTTGTGTGT 1343 

ACATAGTTAC CTTTAACTCA GTGGTTATCT AATAGCTCTA AACTCATTAA AAAAACTCCA 1403 

AGCCTTCCAC CAAAACAGTG CCCCACCTGT ATACATTTTT ATTAAAAAAA TGTAATGCTT 1463 

ATGTATAAAC ATGTATGTAA TATGCTTTCT ATGAATGATG TTTGATTTAA ATATAATACA 152 3 

TATTAAAATG TATGGGAGAA CCAAAAAAAA AAAAAAAAAA 1563 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 

5 10 15 



1 



Val Leu Leu Ala Gly Leu Pro Phe Ala He Leu Thr Ser Arg His Thr 

20 25 30 

Pro Phe Gin Arg Gly Val Phe Cys Asn Asp Glu Ser He Lys Tyr Pro 
35 ' 40 45 

Tyr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Gly Gly He He He 
50 55 60 

Pro Phe Ser He He Val He He Leu Gly Glu Thr Leu Ser Val Tyr 
65 70 75 80 

Cvs Asn Leu Leu His Ser Asn Ser Phe He Arg Asn Asn Tyr He Ala 
7 85 90 95 

Thr He Tyr Lys Ala He Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser 

100 1° 5 110 

Gin Ser Leu Thr Asp He Ala Lys Tyr Ser He Gly Arg Leu Arg Pro 
115 120 125 

His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys Ser 
130 V35 140 
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Mp G1 , Tyr n« 01" Tg *vr <*■ «• S "* ° 1U " 9 SJ 

14 5 ^ 

Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met 

16 5 

^ c «.t Leu Phe val Ale M Tyr Lee OU Ale Arg Met Ly. Gly 
„p xrp Ale «1 L«u Leu Ars Pro Thr Leu Glu Pee Gly Leu V.1 Al. 

195 

val Ser He Tyr val Gly Leu Ser Arg Val Ser Asp Tyr Lys His His 

210 215 
Trp Ser Asp Val Leu Thr Gly Leu lie Glu Gly Ala Leu Val Ala lie 

Z Val Ala val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe 

245 

LyS Glu Arg Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro 



260 



Thr Thr Gly Asn His Tyr Pro Ser Asn His Gin Pro 
275 280 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1566 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 342.. 1196 

(ix) FEATURE: 

(A) NAME /KEY: mat_peptide 

(B) LOCATION: 342 .. 1196 



GGGTAGCAAC 


CGGGGCAGGC 


CGTGCCGGCT 


60 


GGCTGGCACA 


CGAGCGCCTC 


GGCACTAACC 


120 


CCCGGGCGCC 


ATTGCTGGCG 


GTGGGAGCGC 


180 


TCCTCCTCCG 


GCTGGGAGGG 


GCCGTATCTC 


240 


ATAATCAAGG 


GCCTCGGCCG 


TCGTCCCGCA 


300 


GGGCAGAGAC 


C ATG TTT 
Met Phe 
1 


GAC AAG 
Asp Lys 


353 



B S SS SS S S3 S 2 £ S 2 £ S3 S S S 



401 



5 10 
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AAA GCC ATT GGA ACC TTT TTA TTT GGT GCA GCT GCT AGT CAG TCC CTG 
£vs Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala Ser Gin Ser Leu 
y 105 HP 115 

ACT GAC ATT GCC AAG TAT TCA ATA GGC AGA CTG CGG CCT CAC TTC TTG 
S Sp lie Ala Lys Tyr Ser lie Gly Arg Leu Arg Pro His Phe Leu 

120 125 

GAT GTT TG T GAT CCA GAT TGG TCA AAA ATC AAC TGC AGC GAT GGT TAC 
Asp ?" Cys Asp Pro Asp Trp Ser Lys lie Asn Cys Ser Asp Gly Tyr 
135 140 14= 

ATT GAA TAC TAC ATA TGT CGA GGG AAT GCA GAA AGA GTT AAG GAA GGC 
S GlS Tyr Tyr lie Cys Arg Gly Asn Ala Glu Arg Val Lys Glu Gly 
150 155 160 

TTr TCC ttc TAT TCA GGC CAC TCT TCG TTT TCC ATG TAC TGC ATG 
Arg Leu Ser Pne T?r Set Gly His Ser Ser Phe Ser Met Tyr Cys Met 
165 170 l" 5 

rTr GTG GCA cxt T AT CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA 

2S S 3S S Su Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala 

185 I 90 195 



... rTr TT » CGC CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT 
5£ Leu Leu Arg Pro !5 Leu Gin Phe Gly Leu Val Ala Val Ser He 
y 200 205 210 

TAT GTG GGC CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT 
T?r S2 Gly Leu Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
215 220 225 

rTG TTG ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT 
5a? Leu tS Gly Leu He Gin Gly Ala Leu Val Ala lie Leu Val Ala 
230 235 240 

GTA TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA 
?S ™ ?2 Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg 
245 250 255 

AAA GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG 
£S Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly 

265 270 

AAT CAC TAT CCG AGC AAT CAC CAG CCT TGAAAGGCAG CAGGGTGCCC 
Asn His Tyr Pro Ser Asn His Gin Pro 

280 285 



449 



497 



TCC ATG CCT ATG GCT GTT CTA AAA TTG GGC CAA ATA TAT CCA TTT CAG 
Ser Sit Pro Met Ala Val Leu Lys Leu Gly Gin lie Tyr Pro Phe Gin 

25 30 35 

AGA GGC TTT TTC TGT AAA GAC AAC AGC ATC AAC TAT CCG TAC CAT GAC 
S Sy Phe Cys Lys Asp Asn Ser lie Asn Tyr Pro Tyr His Asp 
^ X 40 45 50 

ACT ACC GCC GCA TCC ACT GTC CTC ATC CTA GTG GGG GTT GGC TTG CCC 545 
22 rlr III Ala Ser Thr Val Leu lie Leu Val Gly Val Gly Leu Pro 
55 60 65 

CTT TCC TCT ATT ATT CTT GGA GAA ACC CTG TCT GTT TAC TGT AAC CTT 
Sal Ser Ser lie lie Leu Gly Glu Thr Leu Ser Val Tyr Cys Asn Leu 
70 7 5 80 

TTC CAC TCA AAT TCC TTT ATC AGT AAT AAC TAC ATA GCC ACT ATT TAC 
2s Ser isn Ser Phe He Ser Asn Asn Tyr lie Ala Thr lie Tyr 
85 90 95 100 



593 



641 



689 



737 



785 



833 



881 



929 



977 



1025 



1073 



1121 



1169 



1216 
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A.GGTGAAGCT GGCCTGTTTT CTAAAGGAAA ATGATTGCCA CAAGGCAAGA GGATGCATCT 
TTCTTCCTGG TGTACAAGCC TTTAAAGACT TCTGCTGCTG ATATGCCTCT TGGATGCACA 
CTTTGTGTGT ACATAGTTAC CTTTAACTCA GTGGTTATCT AATAGCTCTA AACTCATTAA 
AAAAACTCCA AGCCTTCCAC CAAAACAGTG CCCCACCTGT ATACATTTTT ATTAAAAAAA 
TGTAATGCTT ATGTATAAAC ATGTATGTAA TATGCTTTCT ATGAATGATG TTTGATTTAA 
ATATAATACA TATTAAAATG TATGGGAGAA CCAAAAAAAA AAAAAAAAAA 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr V.1 Ala Leu Asp Val Leu Cys 

1 5 

Val Leu Leu Ala Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin lie 

20 25 
Tyr p ro Phe Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser lie Asn Tyr 

Pro Tyr His Asp Ser Thr Ala Ala Ser Thr Val Leu lie Leu Val Gly 
50 55 

xr-i cat* ser He He Leu Gly Glu Thr Leu Ser Val 
Val Gly Leu Pro Val Ser ser j-xc u« 80 

65 70 

Tyr cys Asn Leu Leu His Ser Asn Ser Phe He Ser Asn Asn Tyr lie 

Ala Thr lie Tyr Lys Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala 

100 105 
ser Gin Ser Leu Thr Asp He Ala Lys Tyr Ser lie Gly Arg Leu Arg 

115 120 
Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn cys 

130 135 

ser Asp Gly Tyr He Glu Tyr Tyr He cys Arg Gly Asn Ala Glu Arg 

145 150 

Val Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser 

165 17U 
Met Tyr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys 



180 



Gly Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val 



195 200 



Ma Val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His 



1276 

1336 

1396 

1456 

1516 

1566 



215 220 
210 Zlb 
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His Trp Ser Asp Val Leu Thr Glv Leu lie Gin Gly Ala Leu Val Ala 
225 230 235 240 

He Leu Val Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser 

245 * 250 255 

Phe Lys Glu Arg Lys Glu Glu Aso Ser His Thr Thr Leu His Glu Thr 

260 " 265 270 

Pro Thr Thr Gly Asn His Tyr Pre Ser Asn His Gin Pro 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1362 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 294 1226 

(ix) FEATURE: 

(A) NAME /KEY : mat_pepcide 

(B) LOCATION: 2 94.. 1226 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

GGCGCAGCTC TGCAAAAGTT TCTGCTCGGG ATCTGGCTCT CTTCCCCTTG GACTTTAGAA 60 

CGATTTAGGG TTGACAGAGG AAAGCAGAGG CGCGCAGGAG GAGCAGAAAA CACCACCTTC 120 

TGCAGTTGGA GGCAGGCAGC CCCGGCTGCA CTCTAGCCGC CGCGCCCGGA GCCGGGGCCG 180 

ACCCGCCACT ATCCGCAGCA GCCTCGGCCA GGAGGCGACC CGGGCGCCTG GGTGTGTGGC 240 

TGCTGTTGCG GGACGTCTTC GCGGGGCGGG AGGCTCGCGC CGCAGCCAGC GCC ATG 296 

Met 

1 

CAA AAC TAC AAG TAC GAC AAA GCG ATC GTe CCG GAG AGC AAG AAC GGC 344 
Gin Asn Tyr Lys Tyr Asp Lys Ala He Val Pro Glu Ser Lys Asn Gly 

5 10 15 

GGC AGC CCG GCG CTC AAC AAC AAC CCG AGG AGG AGC GGC AGC AAG CGG 3 92 

Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys Arg 
20 25 30 

GTG CTG CTC ATC TGC CTC GAC CTC TTC TGC CTC TTC ATG GCG GGC CTC 440 
Val Leu Leu He Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly Leu 
35 40 45 

CCC TTC CTC ATC ATC GAG ACA AGC ACC ATC AAG CCT TAC CAC CGA GGG 488 
Pro Phe Leu He He Glu Thr Ser Thr He Lys Pro Tyr His Arg Gly 
50 55 60 65 

TTT TAC TGC AAT GAT GAG AGC ATC AAG TAC CCA CTG AAA ACT GGT GAG 536 

Phe Tyr Cys Asn Asp Glu Ser He Lys Tyr Pro Leu Lys Thr Gly Glu 

70 75 80 
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_ nnrv rTC rrr TGT GCC GTG GGG ATC GTC ATT GCC ATC 

g g £ £ S S3 £ g S v.! ay n. v.i ii. «. iu 

85 90 
M ~ nrr rGG GAA TTC TAC CGG ATC TAT TAC CTG AAG AAG 

g S S S g 8? SI S ivr »■ n. *v £* 0 "» 

100 105 



^ « T ,r v n*r zvap CCC TAC GTG GCA GCA CTC TAT AAG CAA 

g g S g S £ £ SS £ »! »U Al, » Tyr «. CI- 



115 120 



„ TrT nrr atC AGC CAG TCT TTC ACA GAC 

GTG GGC TGC TTC CTC TTT GGC TGT GCC ATC AGC CJ ^ ^ ^ ^ 

Val Gly Cys Phe Leu Phe Gly Cys A±a lie ^ ^ 



130 135 



_ ... rTG TCC ATA QGG CGC CTG CGT CCT CAC TTC TTG AGT GTC 

S A?! vS Ser SI Gly Arg Leu Arg Pro His Phe Leu Ser Val 

150 lb:3 



- nrr rar rtc AAC TGC TCT GAA GGC TAC ATT CAG 

£ SS S £ 5 g 2 S £ T cv. ser clu «y Ty, «U al„ 

16 5 170 



«^ nnn r^T PAT GAC AGC AAA GTC CAG GAA GCC AGG AAG 

£ £ £ £ g 5 £ £ 52 «. »i «. ss «- «. - 

180 1Bb 



^ r 6 T rrr TCC TTC TCC ATG TAC ACT ATG CTG TAT 
E S S sfr fly £5 a" E E E Met Tyr Thr Met Leu Tyr 



195 200 



„ rrr rRr ttc ACT TGG CGA GGA GCC CGC CTG 

2 S S £ S SS S £ £ «. gj « «r ». - 



210 215 



CTC CCC CCC CTC CTG C f TTC ACC TTC ATC ATC ATC J=C TTC TAC ACC 

Leu Ars Pro Leu Leu Gin £>ne inr ^ 

230 235 

rr , ™ TCT CGC GTA TCA GAC CAC AAG CAC CAT CCC AGT GAT GTT CTG 

GGA CTu TCT CGC biA His His prQ ger Asp Val Leu 

Gly Leu Ser Arg Val ser *sp 25& 

245 250 



S S S S £ £ S 2 £ S £ g S S S 5E 

260 265 



S g £ S g g S £ - S S | S g S g 



275 280 



584 



632 



^ r» ?\ Tv atp PTT TCA CCT GTG GAC ATT ATT GAC AGG AAC 

S g g g Si g g S S5 v.! Asp H. II- »P AT, Ass 

290 295 

AAT CAC CAC AAC ATG ATG TAGGTGCCAC CCACCTCCTG AGCTGTTTTT 

Asn His His Asn Met Met 

310 

GTAAAA7GAC TGCTGACAGC AAGTTCTTGC TGCTCTCCAA TCTCATCAGA CAGTAGAATG 
TAGGGAAAAA CTTTTGCCCG ACTGATTTTT AAAAAAAAAA AAAAAA 



680 



728 



776 



824 



872 



920 



968 



1016 



1064 



1112 



1160 



1208 



1256 



1316 
1362 
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(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Gin Asn Tyr Lvs Tyr Asp Lys Ala He Val Pro Glu Ser Lys Asn 
1 *5 10 15 

Glv Glv Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 
y 20 25 30 

Arq Val Leu Leu He Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly 
35 40 45 

Leu Pro Phe Leu He He Glu Thr Ser Thr He Lys Pro Tyr His Arg 
50 55 60 

Glv Phe Tyr Cys Asn Asp Glu Ser He Lys Tyr Pro Leu Lys Thr Gly 
6 5 70 75 80 

Glu Thr He Asn Asp Ala Val Leu Cys Ala Val Gly He Val He Ala 

85 90 95 

He Leu Ala He He Thr Gly Glu Phe Tyr Arg He Tyr Tyr Leu Lys 

100 105 HO 

Lvs Ser Arg Ser Thr He Gin Asn Pro Tyr Val Ala Ala Leu Tyr Lys 
1 U 5 120 125 

Gin Val Gly Cys Phe Leu Phe Gly Cys Ala He Ser Gin Ser Phe Thr 
130 135 140 

Asp He Ala Lys Val Ser He Gly Arg Leu Arg Pro His Phe Leu Ser 
145 ISO 155 160 

Val Cvs Asn Pro Asd Phe Ser Gin He Asn Cys Ser Glu Gly Tyr He 

165 170 175 

Gin Asn Tyr Arg Cvs Arg Gly Asp Asp Ser Lys Val Gin Glu Ala Arg 

180 185 190 

Lvs Ser Phe Phe Ser Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu 
195 200 205 

Tvr Leu Val Leu Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg 
210 215 220 

Leu Leu Arg Pro Leu Leu Gin Phe Thr Leu He Met Met Ala Phe Tyr 
225 230 235 240 

Thr Gly Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val 

245 250 255 

Leu Ala Gly Phe Ala Gin Gly Ala Leu Val Ala Cys Cys He Val Phe 

260 265 270 

Phe Val Ser Asp Leu Phe Lys Thr Lys Thr Thr Leu Ser Leu Pro Ala 
275 280 285 

Pro Ala He Arg Lvs Glu He Leu Ser Pro Val Asp He He Asp Arg 
290 ~ " 295 300 
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Asn Asn His His Asn Met Met 



(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 



(ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 4. .833 

(ix) FEATURE: 

(A) NAME / KEY : mat peptide 

(B) LOCATION: 4. .833 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

ACC ATG CAG CGG AGG TGG GTC TTC GTG CTG CTC GAC GTG CTG TGC TTA 
Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys Leu 
x 5 10 1S 

r-rr GTC GCC TCC CTG CCC TTC GCT ATC CTG ACG CTG GTG AAC GCC CCG 
HI SS Ala Ser Leu Pro Phe Ala lie Leu Thr Leu Val Asn Ala Pro 



25 30 
TAC AAG CGA GGA TTT TAC TGC GGG GAT GAC TCC ATC CGG TAC CCC TAC 

Arg Gly Phe Tyr Cys Gly Asp Asp Ser lie Arg Tyr Pro Tyr 
35 40 45 

rrr CCA GAT ACC ATC ACC CAC GGG CTC ATG GCT GGG GTC ACC ATC ACG 
££ Sro Asp Thr lie Thr His Gly Leu Met Ala Gly Val Thr He Thr 

rrr ACC GTC ATC CTT GTC TCG GCC GGG GAA GCC TAC CTG GTG TAC ACA 
25 Thr Va5 lie Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr 
65 ™ 75 

GAC CGG CTC TAT TCT CGC TCG GAC TTC AAC AAC TAC GTG GCT GCT GTA 
Asp Arg Leu Tyr Ser Arg Ser Asp Phe Asa Asn Tyr Val Ala Ala Val 

80 85 90 

TAC AAG GTG CTG GGG ACC TTC CTG TTT GGG GCT GCC GTG AGC CAG TCT 

Gly 

100 

CTG ACA GAC CTG GCC AAG TAC ATG ATT GGG CGT CTG AAG CCC AAC TTC 

Leu 
115 

CTA GCC GTC TGC GAC CCC GAC TGG AGC CGG GTC AAC TGC TCG GTC TAT 
2u Ala Val Cys Asp Pro Asp Trp Ser Arg Val Asn Cys Ser Val Tyr 
130 13 5 140 

GTG CAG CTG GAG AAG GTG TGC AGG GGA AAC CCT GCT GAT GTC ACC GAG 
Val Gin Leu Glu Lys Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu 
145 150 155 

GCC AGG TTG TCT TTC TAC TCG GGA CAC TCT TCC TTT GGG ATG TAC TGC 
Sa Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys 
160 1« "0 175 



TAC AAG GTG CTG GGG ACC TTC CTU TTT v*>«* «v-v- ™>- w~ --- 

™r £ys Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser 

"* 105 

CTG ACA GAC CTG GCC AAG TAC ATG ATT GGG CGT CTG AAG CCC AAC 
Su tS Asp Leu Ala Lys Tyr Met lie Gly Arg Leu Lys Pro Asn Phe 

120 125 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 
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s ss s s s s s a s s = ss s £ s 

180 

^r/^ nan TTP -TC CTG GTG GCC TTT GCC 

GCA CGG CTG CTG CGA CCC ACA GTC CAG TTC TTC CTG ^ ^ ^ 

Ala Arg Leu Leu Arg Pro inr ^ 205 

2 ss s ss s s a = 5 s » s = s s 

210 Zi 



_ „„_ _ r GGG GCA CTG GTG GCT GCC CTC ACT 

GAT GTC CTT GTT GGC CTC CTG CAG GGG ^ ^ ^ ^ ^ Leu Thr 

Asp Val Leu Val Gly i>eu ubu 



225 230 



„ ~« n*r TTC TTC AAA GCC CGA CCC CCA CAG CAC TGT 

GT< T l GC £r ill sS Asp Phe Pne L^ Ala Arg Pro Pro Gin His Cys 
Val Cys Tyr lie ber 250 



245 

240 



ctg « « «■ «> 25 Si S $ S S 2 S S S 3 

Leu Lys Glu Glu Glu Leu Glu Arg uys ^ 27Q 
ACC CTG GGG CGA GGC TG ACCACAACCA CTTATG-GGAT ACCCGCACTC 



Thr Leu Gly Arg Gly 

275 



TOTTCCIOA GGCCGGACCG CGCCCAGGCA GGGAGCTGCT GTGAGTCCAG CTGATGCCCA 
CCCAGGTGGT CCGTCCAGCC TGGTTAGGCA CTGAGGGTTC TGGACGGGCT CCAGGAACCC 
TGGGCTGATG GGAGCAGTGA GCGGTTCCGC TGCCCCCTGC CCTGCACTGG ACGAGGAGTC 
TGGAGATGCC TGGGTAGCCC TCAGCATTTG GAGGGGAACC TGTTCCCGTC GGTCCCCAAA 
TATCCCCTTC TTTITATGGG GTTAAGGAAS GGACCGAGAG ATCAGATAGT TGCTGTTTTG 
TAAAATGTAA TGTATATGTG GTTTTTAGTA AAATAGGGCA CCTGTTTCAC AAAAAAAAAA 



AAAAAAAAA 



(2) INFORMATION FOR SEQ ID NO: 8: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
Me t Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys Leu Leu 

Val Ala Ser Leu Pro Phe Ala lie Leu Thr Leu Val Asn Ala Pro Tyr 

20 " 

Lys Axg Gly V CV= "V A=P Asp s« Ar, Tyr Pre Tyr « 
Pro M p X IL T*r Hi. Gly M »t Al. Gly val Tnr n. * «. 

50 



576 



624 



672 



720 



768 



816 



863 



923 
983 
1043 
1103 
1163 
1223 
1232 
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Thr Val He Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr Asp 
65 70 75 80 

Arg Leu Tyr Ser Arg Ser Asp Phe Asn Asn Tyr Val Ala Ala Val Tyr 

85 90 95 

Lvs Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser Leu 
* 100 105 HO 

Thr Asp Leu Ala Lys Tyr Met lie Gly Arg Leu Lys Pro Asn Phe Leu 
* U5 120 125 

Ala Val Cys Asp Pro Asp Trp Ser Arg Val Asn Cys Ser Val Tyr Val 
130 135 140 

Gin Leu Glu Lys Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala 
145 150 155 160 

Arcr Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys Met 
y 165 170 175 

Val Phe Leu Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp Ala 

180 185 190 

Arq Leu Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu 
* 195 ~ 200 205 

Tvr Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 215 220 

Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr Val 
225 230 235 240 

Cvs Tyr He Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His Cys Leu 

245 250 255 

Lvs Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu Thr Leu Thr 
Y 260 265 270 

Leu Gly Arg Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val He Cys 
1 5 10 15 

Val Leu Leu Ala Gly Leu Pro Phe Ala He Leu Thr Ser Arg His Thr 

20 25 30 

Pro Phe Gin Arg Gly lie Phe Cys Asn Asp Asp Ser He Lys Tyr Pro 
35 " 40 45 

Tyr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Gly Gly He Val He 
50 55 - 0 
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Pro Phe Cys lie He Val Met Ser He Gly Glu Ser Leu Ser Val Tyr 
65 70 

Phe Asn Val Leu His Ser Asn Ser Phe Val Gly Asn Pro Tyr lie Ala 

85 90 

Thr lie Tyr Lys Ala val Gly Ala Phe Leu Phe Gly Val Ser Ala Ser 

100 10b 

Gin Ser Leu Thr Asp He Ala Lys Tyr Thr He Gly Ser Leu Arg Pro 



115 



120 



His Phe Leu Ala He Cys Asn Pro Asp Trp Ser Lys He Asn Cys Ser 

130 135 
Asp Gly Tyr lie Glu Asp Tyr He Cys Gin Gly Asn Glu Glu Lys Val 

Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser Met 

165 

Tyr cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys Gly 

180 18b 

mn ura Leu Leu Arg Pro Met Leu Gin Phe Gly Leu He Ala 
Asp Trp Ala Arg Leu i,eu /vxy " 

195 200 
Phe Ser lie Tyr Val Gly Leu Ser Arg Val Ser Ajp Tyr Lys His His 

210 215 
Trp Ser Asp Val Thr Val Gly Leu lie Gin Gly Ala Ala Met Ala lie 
225 230 

Le u val Ala Leu Tyr Val Ser Asp Phe Phe Lys Asp Thr His Ser Tyr 

245 250 

Lys Glu Arg Lys Glu Glu Asp Pro His Thr Thr Leu His Glu Thr Ala 

260 265 

Ser ser Arg Asn Tyr Ser Thr Asn His Glu Pro 
275 280 

2) INFORMATION FOR SEQ ID NO; 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2B4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 

5 10 



val Leu Leu Ala Gly Leu Pro Phe Ala lie Leu Thr Ser Arg His Thr 

20 25 

Pro Phe Gin Arg Gly Val Phe Cys Asn Asp Glu Ser lie Lys Tyr Pro 



35 40 



Tyr Lys Glu Asp Thr He Pro Tyr Ala Leu Leu Gly Gly He He He 



50 55 
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Pro Phe Ser He lie Val He He Leu Oly Glu Thr Leu Ser Val Tyr 

65 ^ 

cys Asn Leu Leu His Ser Asn Ser Phe lie Arg Asn Asn Tyr lie Ala 

8 5 

m u. Tyr w. Al. II. Oly Thr Ph. » Ph. Gly £j «* s « 
Glu ser «. TM Asp XI, Al, Lys Tyr 3.r lie Gly Aro Uu "9 Pro 

115 1ZU 
Hi. Phe «u Asp Val Cys Asp Pro Asp Trp Ser Lys II. Asn Cys S.r 

Mp Ty Tyr U. G!u Tyr Tyr !1. cys Ar g Gly Asu A!. Glu Ar g «1 

145 lbU 

t^, Q.r Phe Tvr Ser Gly His Ser Ser Phe Ser Met 
Lys Glu Gly Arg Leu Ser Phe ryr 175 

165 

xyr cys Met Leu Phe val Ala Leu Tyr Leu Gin Ala Arg Hat Lys Gly 

^ Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu val Ala 

Val Ser lie Tyr Val Gly £u Ser Arg Val Ser Asp Tyr Lys His His 

Tr p ser Asp Val Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala He 
225 

M „ v.l Al. VI Tyr V.1 S.r Asp Ph. Ph. Lys Glu Arg Thr S.r Ph. 
,y. Glu Lys Til Glu Asp S.. His Thr Thr L.U His Glu Thr Pro 

260 

^ ^ , n C r» His Tvr Pro Ser Asn His Gin Pro 
Thr Thr Gly Asn His ryr 

275 280 



(2) 



INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 11: 

Met Phe Asp Lys Thr Arg Leu Pro Tyr Val Ala Leu Asp Val Leu Cys 



1 5 



r n a ser Met Pro Met Ala Val Leu Lys Leu Gly Gin He 
Val Leu Leu Ala ser Met r-tw ->c 30 



20 25 



Tyr pro Phe Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser He Asn Tyr 
3 5 



P ro Tyr His Asp Ser Thr Ala Ala Ser Thr Val Leu He Leu Val Gly 



50 55 
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val Gly Leu Pro Val Ser Ser lie He Leu Gly Glu Thr Leu Ser Val 
65 70 75 

Tyr Cys Asn Leu Leu His Ser Asn Ser Phe lie Arg Asn Asn Tyr lie 

Ala Thr He Tyr Lys Ala lie Gly Thr Phe Leu Phe Gly Ala Ala Ala 

100 105 

ser Gin Ser Leu Thr Asp lie Ala Lys Tyr Ser He Gly Arg Leu Arg 
115 120 125 

Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser Lys He Asn Cys 

130 135 140 

Ser Asp Gly Tyr He Glu Tyr Tyr He Cys Arg Gly Asn Ala Glu Arg 
145 ISO 155 

val Lys Glu Gly Arg Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Ser 

165 170 175 

Met Tyr Cys Met Leu Phe Val Ala Leu Tyr Leu Gin Ala Arg Met Lys 

1B0 185 

Gly Asp Trp Ala Arg Leu Leu Arg Pro Thr Leu Gin Phe Gly Leu Val 
J 195 200 205 

Ala Val Ser He Tyr Val Gly Leu Ser Arg Val Ser Asp Tyr Lys His 
210 215 220 

His Trp Ser Asp Val Leu Thr Gly Leu He Gin Gly Ala Leu Val Ala 
225 230 

He Leu Val Ala Val Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser 

245 2S0 



Phe Lys Glu Arg Lys Glu Glu Asp Ser His Thr Thr Leu His Glu Thr 

Pro 

27 5 280 285 



260 265 
Pro Thr Thr Gly Asn His Tyr Pro Ser Asn His Gin Pro 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Met Gin Asn Tyr Lys Tyr Asp Lys Ala lie Val Pro Glu Ser Lys Asn 
x s 10 

Gly Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 

20 25 

Arq Val Leu Leu He Cys Leu Asp Leu Phe Cys Leu Phe Met Ala Gly 
y 40 45 



35 



Leu Pro Phe Leu He He Glu Thr Ser Thr He Lys Pro Tyr His Arg 



50 55 60 
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Gly Phe Tyr Cys Asa Asp Glu Ser lie Lys Tyr Pro Leu Lys Thr Gly 
65 70 

Glu Thr lie Asn Asp Ala Val Leu Cys Ala Val Gly He Val lie Ala 



85 



He Leu Ala He He Thr Gly Glu Phe Tyr Arg He Tyr Tyr Leu Lys 

100 105 

Lys Ser Arg Ser Thr He Gin Asn Pro Tyr Val Ala Ala Leu Tyr Lys 
1 1 5 

Gin Val Gly Cys Phe Leu Phe Gly Cys Ala lie Ser Gin Ser Phe Thr 
L 3 0 

Asp lie Ala Lys Val Ser lie Gly Arg Leu Arg Pro His Phe Leu Ser 
145 

val Cys Asn Pro Asp Phe Ser Gin lie Asn Cys Ser Glu Gly Tyr lie 
Gin Asn Tyr Arg Cys Arg Gly Asp Asp Ser Lys Val Gin Glu Ala Arg 



180 



Lys ser Phe Phe Ser Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu 



195 



Tyr Leu Val Leu Tyr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg 

210 215 
Leu Leu Arg Pro Leu Leu Gin Phe Thr Leu lie Met Met Ala Phe Tyr 
225 230 235 

Thr Gly Leu Ser Arg val Ser Asp His Lys His His Pro Ser Asp Val 

245 250 

L eu Ala Gly Phe Ala Gin Gly Ala Leu Val Ala Cys Cys lie Val Phe 

260 265 

Phe Val Ser Asp Leu Phe Lys Thr Lys Thr Thr Leu Ser Leu Pro Ala 
2 7 5 

Pro Ala lie Arg Lys Glu lie Leu Ser Pro Val Asp lie lie Asp Arg 



290 



295 



Asn Asn His His Asn Met Met 
305 310 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 276 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO:13: 

Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys Leu Leu 

c 10 



1 5 



val Ala Ser Leu Pro Phe Ala He Leu Thr Leu Val Asn Ala Pro Tyr 

20 25 
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Lys Arg Gly Phe Tyr Cys Gly Asp Asp Ser lie Arg Tyr Pro Tyr Arg 
35 40 45 

Pro Asp Thr lie Thr His Gly Leu Met Ala Gly Val Thr lie Thr Ala 
50 55 60 

Thr Val He Leu Val Ser Ala Gly Glu Ala Tyr Leu Val Tyr Thr Asp 
65 70 75 80 

Arc Leu Tyr Ser Arg Ser Asd Phe Asn Asn Tyr Val Ala Ala Val Tyr 

85 90 95 

Lvs Val Leu Gly Thr Phe Leu Phe Gly Ala Ala Val Ser Gin Ser Leu 

100 105 HO 

Thr Asp Leu Ala Lys Tyr Met He Gly Arg Leu Lys Pro Asn Phe Leu 
115 120 125 

Ala Val Cys Asp Pro Asp Tiro Ser Arg Val Asn Cys Ser Val Tyr Val 
130 135 140 

Gin Leu Glu Lvs Val Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala 
145 * 150 155 160 

Arq Leu Ser Phe Tyr Ser Gly His Ser Ser Phe Gly Met Tyr Cys Met 
* 165 170 175 

Val Phe Leu Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp Ala 

180 185 190 

Arq Leu Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu 
3.95 " 200 205 

Tyr Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 215 220 

Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr Val 
225 230 235 240 

Cvs Tvr He Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His Cys Leu 
y 7 245 250 255 

Lvs Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu Thr Leu Thr 
7 260 265 270 

Leu Gly Arg Gly 
275 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GGCTCTAGAT ATTAATAGTA ATCAATTAC 
(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 2 6 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
CCTCACGCAT GCACCATGGT AATAGC 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GGTGCATGCG TGAGGCTCCG GTGC 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GTAGTTTTCA CGGTACCTGA AATGGAAG 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 
GGCATGGTAC CATGTTTGAC AAGACGCGGC 
(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID 
CATATGTAGT ATTCAATGTA ACC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
TGATGGCTAG CATGCAGAGA AGATGGGTCT TCGTGCTGCT CGACGTG 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



47 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

2 4 

AGTGCGGGAT CCCATAAGTG GTTG 
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15 



20 
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30 



35 



What la r 1a ' ITied Is: 

1. An isolated polynucleotide encoding human 

phosphatidic acid phosphatase wherein said polynucleotide 
encodes a protein comprising a polypeptide sequence 
selected from the group consisting of (i) the sequence at 
amino acid number 1 to amino acid number 284 in Figure 1 
(SEQ ID NO:2), (ii) the sequence at amino acid number 1 
to amino acid number 285 in Figure 2 (SEQ ID N0:4) , and 
(iii) the sequence at amino acid number 1 to amino acid 
number 276 in Figure 4 (SEQ ID NO: 8) . 

2 . An isolated human phosphatidic acid phosphatase 
protein, wherein said protein comprises a polypeptide 
sequence selected from the group consisting of (i) the 
sequence at amino acid number 1 to amino acid number 284 
in Figure 1 (SEQ ID NO:2) , (ii) the sequence at ammo 
acid number 1 to amino acid number 285 in Figure 2 (SEQ 
ID NO-4), and (iii) the sequence at amino acid number 1 
to amino acid number 276 in Figure 4 (SEQ ID NO: 8) . 

3 . A method of preparing a human phosphatidic acid 
phosphatases protein comprising the steps of (i) 
transforming a host cell with an expression vector 
comprising a polynucleotide encoding human phosphatidic 
acid phosphatase, (ii) culturing said transformed host 
cells which express said protein and (iii) isolating said 
protein . 

4. The method of claim 3, wherein said 
polynucleotide encoding human phosphatidic acid is 
selected from the group consisting of (i) the sequence at 
amino acid number 1 to amino acid number 284 in Figure 1 
(SEQ ID NO:2), (ii) the sequence at amino acid number 1 
to amino acid number 285 in Figure 2 (SEQ ID NO:4) , (in) 
the sequence at amino acid number 1 to amino acid number 
311 in Figure 3 (SEQ ID NO:6), and (iv) the sequence at 
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amino acid number 1 to amino acid number 276 in Figure 4 
(SEQ ID NO: 8) . 

5. A method of dephosphorylating a substrate 
comprising recombinant ly producing a human pho6phatidic 
acid phosphatase protein and contacting said substrate 
with an effective amount of said recombinant ly produced 
human phosphatidic acid phosphatase protein such that 
said protein catalyzes the dephosphorylation of said 
substrate . 

6. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 284 in Figure 1 (SEQ ID NO: 2) , 

7. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 285 in Figure 2 (SEQ ID N0:4) . 

8. The method of claim 5 f wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 311 in Figure 3 (SEQ ID NO: 6) . 

9. The method of claim 5, wherein said protein 
comprises the polypeptide sequence at amino acid number 
1 to amino acid number 276 in Figure 4 (SEQ ID NO: 8) . 

10. The method of claim 5, wherein said substrate 
is selected from the group consisting of phosphatidic 
acid, lysophosphatidic acid, ceramide 1-phosphate, and 
sphingosine 1-phosphate . 

11. The method of claim 5, wherein said contacting 
is effected in vitro, and further comprises the step of 
isolating said dephosphoryled substrate. 
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12. The method of claim 5, wherein said contacting 
step occurs in vivo and is effected by the administration 
of said human phosphatide acid phosphatase to a mammal 
in need thereof. 

13 . A method of dephosphorylating a substrate 
comprising contacting said substrate with an effective 
amount of isolated human phosphatide acid phosphatase 
protein such that said protein catalyzes the. 
dephosphorylation of said substrate, wherein said 
substrate is selected from the group consisting of 
lysophosphatidic acid, ceramide l-phosphate, and 
sphingosine l-phosphate. 
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Fig. 1A 

CCTGTGGGAGAGAGCGCCGGGATCCGGACGGGGTAGOUCCGGGGCAGGCCGTGCCGGCTGA 
GGAGGTCCTGAGGCTACAGAGCTGCCGCGGCTGGCACACGAGCGCCTCGGCACTAACCGA 
GTGTTCGCGGGGGCTGTGAGGGGAGGGCCCCGGGCGCCATTGCTGGCGGTGGGAGCGCCG 
CCCGGTCTCAGCCCGCCCTCGGCTGCTCTCCTCCTCCGGCTGGGAGGGGCCGTATCTCGG 
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Fig. 1B 

rAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA ACA ACT GGG 
Su Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr Gly 

AAT CAC TAT CCG AGC AAT CAC CAG CCT TGA AAG GCAGCAGGGTGCCCAG 
Asn His Tyr Pro Ser Asn His Gin Pro *** 

280 

T^TGTGTAWTRCTTACCTTTAACTCRGTGGTTATCTAATAGCTCTAAACTCATTAAAft 
ATAAT AC AT AT T AAAAT G T AT GG G AG AACC AAAAAAAAAAAAAAAAAA 
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Fig. 2A 

ggccgSc^ 

TCATTCCATCGCCCTTGCCGGGCAGCCCGGGCAGAGACC ATG TTT GAC AAG ACG 

6 5 

CGG CTG CCG TAC GTG GCC CTC GAT GTG CTC TGC GTG TTG CTG GCT 
Arq Leu Pro Tyr Val Ala Leu Asp Val Leu Cys Val Leu Leu Ala 

10 15 
TCC ATG CCT ATG GCT GTT CTA AAA TTG GGC CAA ATA TAT CCA TTT 
Ser Met Pro Met Ala Val Leu Lys Leu Gly Gin He Tyr Pro Phe 

25 30 
TAC AGA GGC TTT TTC TGT AAA GAC AAC AGC ATC AAC TAT CCG TAC 
Gin Arg Gly Phe Phe Cys Lys Asp Asn Ser He Asn Tyr Pro Tyr 

CAT GAC AGT ACC GCC GCA TCC ACT GTC CTC ATC CTA GTG GGG GTT 
Sis Asp Ser Thr Ala Ala Ser Thr Val Leu lie Leu Val Gly Val 

55 60 00 

rrr TTG CCC GTT TCC TCT ATT ATT CTT GGA GAA ACC CTG. TCT GTT 
Gly leu Pro Val Ser Ser lie lie Leu Gly Glu Thr Leu Ser Val 

TAC TGT AAC CTT TTG CAC TCA AAT TCC TTT ATC AGT AAT AAC TAC 
Tyr Cys Asn Leu Leu His Ser Asn Ser Phe He Ser Asn Asn Tyr 

vra rrr ACT ATT TAC AAA GCC ATT GGA ACC TTT TTA TTT GGT GCA 
lie Sa Sr III Tyr Lys Ala lie Gly Thr Phe Leu Phe Gly Ala 

100 105 
GCT GCT AGT CAG TCC CTG ACT GAC ATT GCC AAG TAT TCA ATA GGC 
Ala Ala Ser Gin Ser Leu Thr Asp lie Ala Lys Tyr Ser lie Gly 

115 120 i " 

AGA CTG CGG CCT CAC TTC TTG GAT GTT TGT GAT CCA GAT TGG TCA 
Arg Leu Arg Pro His Phe Leu Asp Val Cys Asp Pro Asp Trp Ser 

130 135 isw 

AAA ATC AAC TGC AGC GAT GGT TAC ATT GAA TAC TAC ATA TGT CGA 
He Asn Cys Ser Asp Gly Tyr lie Glu Tyr Tyr lie Cys Arg 

145 150 ij-j 

GGG AAT GCA GAA AGA GTT AAG GAA GGC AGG TTG TCC TTC TAT TCA 
Gly Asn Ala Glu Arg Val Lys Glu Gly Arg Leu Ser Phe Tyr Ser 

CGC CAC TCT TCG TTT TCC ATG TAC TGC ATG CTG TTT GTG GCA CTT 
Gly Ss Ser Ser Phe Ser Met Tyr Cys Met Leu Phe Val Ala Leu 

175 180 10:3 

TAT CTT CAA GCC AGG ATG AAG GGA GAC TGG GCA AGA CTC TTA CGC 
Tyr Leu Gin Ala Arg Met Lys Gly Asp Trp Ala Arg Leu Leu Arg 

CCC ACA CTG CAA TTT GGT CTT GTT GCC GTA TCC ATT TAT GTG GGC 
Pro i£ Leu Gin Phe Gly Leu Val Ala Val Ser lie Tyr Val Gly 

205 210 Zi3 

CTT TCT CGA GTT TCT GAT TAT AAA CAC CAC TGG AGC GAT GTG TTG 
Leu Ser Arg Val Ser Asp Tyr Lys His His Trp Ser Asp Val Leu 

220 225 _ .. 

ACT GGA CTC ATT CAG GGA GCT CTG GTT GCA ATA TTA GTT GCT GTA 
Thr Gly Leu He Gin Gly Ala Leu Val Ala lie Leu Val Ala Val 

235 240 
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Fig. 2B 

TAT GTA TCG GAT TTC TTC AAA GAA AGA ACT TCT TTT AAA GAA AGA 1121 
Tyr Val Ser Asp Phe Phe Lys Glu Arg Thr Ser Phe Lys Glu Arg 

250 260 
AAA GAG GAG GAC TCT CAT ACA ACT CTG CAT GAA ACA CCA AC A ACT 1166 
Lvs Glu Glu Asp Ser His Thr Thr Leu His Glu Thr Pro Thr Thr 
Y 265 270 275 

GGG AAT CAC TAT CCG AGC AAT CAC CAG CCT TGA AAGGCAGCAGGGTGCC 1215 
Gly Asn His Tyr Pro Ser Asn His Gin Pro *** 

280 285 
CAGGTGAAGCTGGCCTGTTTTCTAAAGGAAAATGATTGCCACAAGGCAAGAGGATGCATC 127 5 

TTTCTTCCTGGTGTACAAGCCTTTAAAGACTTCTGCTGCTGATATGCCTCTTGGATGCAC 1335 
ACTTTGTGTGTACATAGTTACCTTTAACTCAGTGGTTATCTAATAGCTCTAAACTCATTA 
AAAAAACTCCAAGCCTTCCACCAAAACAGTGCCCCACCTGTATACATTTTTATTAAAAAA 
ATGTAATGCTTATGTATAAACATGTATGTAATATGCTTTCTATGAATGATGTTTGATTTA 
AATATAATACATATTAAAATGTATGGGAGAACCAAAAAAAAAAAAAAAAAA 
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Fig. 3 A 



62 



GGCGCAGCTCTGCAAAAGTTTCTGCTCGGGATCTGGCTCTCTTCCCCTTGGACTTTAG^CG 
ATTTAGGGTTGACAGAGGAAAGCAGAGGCGCGCAGGAGGAGCAGAAAACACCACCTTCTG 1 Z ^ 



ATG CAA 299 



344 



rArTTGGAGGCAGGCAGCCCCGGCTGCACTCTAGCCGCCGCGCCCGGAGCCGGGGCCGAC 1W 

SgSaSa?SgScagcctcggccaggaggcgacccgggcgcctgggtgtgtggctg 242 

CTGTTGCGGGACGTCTTCGCGGGGCGGGAGGCTCGCGCCGCAGCCAGCGCC ^ 

AAC TAC AAG TAC GAC AAA GCG ATC GTC CCG GAG AGC AAG AAC GGC 
£sn Tyr Lys Tyr Asp Lys Ala He Val Pro Glu Ser Lys Asn Gly 

5 10 15 

GGC AGC CCG GCG CTC AAC AAC AAC CCG AGG AGG .AGC GGC AGC AAG 
Gly Ser Pro Ala Leu Asn Asn Asn Pro Arg Arg Ser Gly Ser Lys 

CGG GTG CTG CTC ATC TGC CTC GAC CTC TTC TGC CTC TTC ATG GCG 
Arg Val Leu Leu lie Cys Leu Asp Leu Phe Cys Leu Phe Met Ala 

35 40 4o 

rrr CTC CCC TTC CTC ATC ATC GAG ACA AGC ACC ATC AAG CCT TAC 
Gly Leu Pro Phe Leu He He Glu Thr Ser Thr lie Lys Pro Tyr 

50 55 60 

CAC CGA GGG TTT TAC TGC AAT GAT GAG AGC ATC AAG TAC CCA CTG 
His Arg Gly Phe Tyr Cys Asn Asp Glu Ser lie Lys Tyr Pro Leu 

65 70 75 

AAA ACT GGT GAG ACA ATA AAT GAC GCT GTG CTC TGT GCC GTG GGG 
Lys Thr Gly Glu Thr lie Asn Asp Ala Val Leu Cys Ala Val Gly 

80 85 90 

ATC GTC ATT GCC ATC CTC GCG ATC ATC ACG GGG GAA TTC TAC CGG 
lie Val He Ala He Leu Ala He He Thr Gly Glu Phe Tyr Arg 

95 100 105 

ATC TAT TAC CTG AAG AAG TCG CGG TCG ACG ATT CAG AAC CCC TAC 
lie Tyr Tyr Leu Lys Lys Ser Arg Ser Thr He Gin Asn Pro Tyr 

GTG GCA GCA CTC TAT AAG CAA GTG GGC TGC TTC CTC TTT GGC TGT 
Val Ala Ala Leu Tyr Lys Gin Val Gly Cys Phe Leu Phe Gly Cys 

125 13° 13 

GCC ATC AGC CAG TCT TTC ACA GAC ATT GCC AAA GTG TCC ATA GGG 
Ala He Ser Gin Ser Phe Thr Asp He Ala Lys Val Ser He Gly 

140 135 
CGC CTG CGT CCT CAC TTC TTG AGT GTC TGC AAC CCT GAT TTC AGC 
Arg Leu Arg Pro His Phe Leu Ser Val Cys Asn Pro Asp Phe Ser 

155 160 I 65 

CAG ATC AAC TGC TCT GAA GGC TAC ATT CAG AAC TAC AGA TGC AGA 
Gin lie Asn Cys Ser Glu Gly Tyr lie Gin Asn Tyr Arg Cys Arg 

GGT GAT GAC AGC AAA GTC CAG GAA GCC AGG AAG TCC TTC TTC TCT 
Gly Asp Asp Ser Lys Val Gin Glu Ala Arg Lys Ser Phe Phe Ser 

185 190 I 95 

GGC CAT GCC TCC TTC TCC ATG TAC ACT ATG CTG TAT TTG GTG CTA 
Gly His Ala Ser Phe Ser Met Tyr Thr Met Leu Tyr Leu Val Leu 

200 205 210 

TAC CTG CAG GCC CGC TTC ACT TGG CGA GGA GCC CGC CTG CTC CGG 
Tvr Leu Gin Ala Arg Phe Thr Trp Arg Gly Ala Arg Leu Leu Arg 

215 220 225 

CCC CTC CTG CAG TTC ACC TTG ATC ATG ATG GCC TTC TAC ACG GGA 
Pro Leu Leu Gin Phe Thr Leu lie Met Met Ala Phe Tyr Thr Gly 

230 235 240 

CTG TCT CGC GTA TCA GAC CAC AAG CAC CAT CCC AGT GAT GTT CTG 
Leu Ser Arg Val Ser Asp His Lys His His Pro Ser Asp Val Leu 
245 250 255 
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Fig. 3B 
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Fig. 4 A 

ACC ATG CAG CGG AGG TGG GTC TTC GTG CTG CTC GAC GTG CTG TGC 47 
Met Gin Arg Arg Trp Val Phe Val Leu Leu Asp Val Leu Cys 

5 10 
TTA CTG GTC GCC TCC CTG CCC TTC GCT ATC CTG ACG CTG GTG AAC 92 
Leu Leu Val Ala Ser Leu Pro Phe Ala He Leu Thr Leu Val Asn 

15 20 25 

GCC CCG TAC AAG CGA GGA TTT TAC TGC GGG GAT GAC TCC ATC CGG 137 
Ala Pro Tyr Lys Arg Gly Phe Tyr Cys Gly Asp Asp Ser He Arg 

™ 35 40 

TAC CCC TAC CGT CCA GAT ACC ATC ACC CAC GGG CTC ATG GCT GGG 182 
Tvr Pro Tyr Arg Pro Asp Thr lie Thr His Gly Leu Met Ala Gly 

45 50 55 

GTC ACC ATC ACG GCC ACC GTC ATC CTT GTC TCG GCC GGG GAA GCC 227 
Val Thr lie Thr Ala Thr Val He Leu Val Ser Ala Gly Glu Ala 

GO 65 70 

TAC CTG GTG TAC ACA GAC CGG CTC TAT TCT CGC TCG GAC TTC AAC 272 
Tvr Leu Val Tyr Thr Asp Arg Leu Tyr Ser Arg Ser Asp Phe Asn 

75 ' 80 85 

AAC TAC GTG GCT GCT GTA TAC AAG GTG CTG GGG ACC TTC CTG TTT 317 
Asn Tyr Val Ala Ala Val Tyr Lys Val Leu Gly Thr Phe Leu Phe 

90 95 100 

GGG GCT GCC GTG AGC CAG TCT CTG ACA GAC CTG GCC AAG TAC ATG 362 
Glv Ala Ala Val Ser Gin Ser Leu Thr Asp Leu Ala Lys Tyr Met 
105 HO 115 

ATT GGG CGT CTG AAG CCC AAC TTC CTA GCC GTC TGC GAC CCC GAC 407 
He Gly Arg Leu Lys Pro Asn Phe Leu Ala Val Cys Asp Pro Asp 
120 I 25 130 

TGG AGC CGG GTC AAC TGC TCG GTC TAT GTG CAG CTG GAG AAG GTG 4 52 

Trp Ser Arg Val Asn Cys Ser Val Tyr Val Gin Leu Glu Lys Val 
135 I 40 145 

TGC AGG GGA AAC CCT GCT GAT GTC ACC GAG GCC AGG TTG TCT TTC 497 
Cys Arg Gly Asn Pro Ala Asp Val Thr Glu Ala Arg Leu Ser Phe 
150 1^5 1^0 

TAC TCG GGA CAC TCT TCC TTT GGG ATG TAC TGC ATG GTG TTC TTG 542 
Tvr Ser Gly His Ser Ser Phe Gly Met Tyr Cys Met Val Phe Leu 
165 I 70 175 

GCG CTG TAT GTG CAG GCA CGA CTC TGT TGG AAG TGG GCA CGG CTG 587 
Ala Leu Tyr Val Gin Ala Arg Leu Cys Trp Lys Trp Ala Arg Leu 
1B0 I 85 190 

CTG CGA CCC ACA GTC CAG TTC TTC CTG GTG GCC TTT GCC CTC TAC 632 
Leu Arg Pro Thr Val Gin Phe Phe Leu Val Ala Phe Ala Leu Tyr 
195 200 205 

GTG GGC TAC ACC CGC GTG TCT GAT TAC AAA CAC CAC TGG AGC GAT 677 
Val Gly Tyr Thr Arg Val Ser Asp Tyr Lys His His Trp Ser Asp 
210 2 15 220 

GTC CTT GTT GGC CTC CTG CAG GGG GCA CTG GTG GCT GCC CTC ACT 722 
Val Leu Val Gly Leu Leu Gin Gly Ala Leu Val Ala Ala Leu Thr 
225 2 30 "5 

GTC TGC TAC ATC TCA GAC TTC TTC AAA GCC CGA CCC CCA CAG CAC 7 67 

Val Cvs Tvr He Ser Asp Phe Phe Lys Ala Arg Pro Pro Gin His 
240 2 *5 250 

TGT CTG AAG GAG GAG GAG CTG GAA CGG AAG CCC AGC CTG TCA CTG 812 
Cvs Leu Lys Glu Glu Glu Leu Glu Arg Lys Pro Ser Leu Ser Leu 
255 260 265 

ACG TTG ACC CTG GGG CGA GGC TGA CCACAACCACTTATGGGATACCCGCACT 864 
Thr Leu Thr Leu Gly Arg Gly *** 
270 275 
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Fig. 4B 
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Fig. 6 
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Fig. 7 
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Fig. 8 
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Fig. 9 
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